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£N| Abstract. Deviation of ergodic sums is studied for substitution 

dynamical systems with a matrix that admits eigenvalues of mod- 
^ ulus 1. The functions 7 we consider are the corresponding eigen- 

^"5 functions. In Theorem |1.1| we prove that the limit inferior of the 

ergodic sums (n,j(xo) + . . . + j(x n -i)) ne f{ is bounded for every 
point x in the phase space. In Theorem |1.2| we prove existence of 
limit distributions along certain exponential subsequences of times 
for substitutions of constant length. Under additional assumptions, 
we prove that ergodic integrals satisfy the Central Limit Theorem 
(Theorem [Ol Theorem [L9 1 . 
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1. Introduction 

Let a be a primtive substitution over a finite alphabet A, let M a be 
the matrix substitution, and let X a be the corresponding subshift in 
the space of bi-infinite sequences (the precise definitions are recalled in 
Section [2J . The aim of this paper is to study the asymptotic behaviour 
of ergodic sums for the minimal dynamical system (X a , T) in the (non- 
hyperbolic) case when the matrix M a has an eigenvalue of modulus 
one. For a function / : X a — > K, all x G X a and all n6N, set 

n-l 

S n f{x) = Y,f(T k x). 

k=0 

In this paper, we shall only consider functions / depending on the first 
coordinate of the symbolic sequence. In what follows, we will identify 
such a function / with the corresponding vector in C*- 4 . 

1.1. Limit inferior of ergodic sums. 
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Theorem 1.1. Let a be a primitive substitution with matrix M a . As- 
sume that M a has an eigenvalue 82 of modulus 1 and let V = (7(a), a G 
A)* be a right eigenvector associated to 62- There exists a constant C 
(depending only on a and 7 ) such that, for any x G X a , we have 

(1.1) liminf \S nl (x)\ < C and liminf \S nl (T~ n x)\ < C. 

Recall that, since T is minimal, by the Gottschalk-Hedlund Theorem, 
the function 7 is a coboundary if and only if the ergodic sums S n r y(x) 
remain bounded, as ti — >■ 00, for some and, consequently, for every 
x G X a . Adamczewski |Ad] proved that, if 6 2 is not a root of unity, 
then for almost all x, we have 

(1.2) limsup |5' n 7(x)| = 00. 

If 62 is a root of unity, then Adamczewski |Ad] gives a necessary and 
sufficient combinatorial condition ensuring that 7 is a coboundary. For 



almost all points the conclusion of Theorem IT is true for any ergodic 
transformation. It is important for us to have the result for all points. 
Observe that there exist infinitely many primitive substitutions with 
a hyperbolic matrix with the following property (see jBHMj ) . If the 
function V is in the unstable space of the matrix or equivalently if there 
exists x such that 

limsup I Snj (x) I = 00 
then there exists x such that 

liminf | S n r y(x) \ = 00 and liminf |5' n 7(T~ n x)| = 00 

n— >oo n— >oo 

Below, we give further applications of Theor em |1.1| to afline interval 



exchange transformations (Corollaries 1.5 and 1.6). 



1.2. Limit distributions for substitutions of constant length. 

Informally, the mechanism of the Central Limit Theorem for substitu- 
tions with eigenvalue 1 can be formulated as follows. A long orbit of 
a substitution dynamical system admits a representation as a disjoint 
union of segments of different exponential scales. Since the eigenvalue 
is equal to 1, these segments give roughly the same contribution. By 
the Markov property, these segments are asymptotically independent, 
whence the Central Limit theorem. 

We are only able to make this argument precise under significant ad- 
ditional restrictions. 

Recall that a substitution a has constant length d if the image of each 
letter has length d. For instance, this condition implies that the maxi- 
mal eigenvalue of the matrix M a is d. Furthermore, we assume that 1 
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is an eigenvalue and consider the eigenvector 7, 7M CT = 7. For a given 
t G [0, 1], we consider ergodic sums S^tp**] (7) as a sequence of random 
variables where the initial point is distributed according to the unique 
shift invariant measure on X a . The behaviour of these sequences sig- 
nificantly depends on the expansion of t in basis d; namely on the 
sequence r t = (r„)„ eN G {0, . . . , d — 1} N such that t = ^2 n<1 T n d n . We 
first consider the case when this expansion is eventually periodic. 

Theorem 1.2. Let a be a primitive substitution of constant length d. 
Assume that M a has eigenvalue 1 with eigenvector 7 . Let t G [0, 1] and 
assume that the expansion r t = (r n )„ g N of t in basis d is eventually 
periodic. Then, 

1 r, 1 \ distr 

n 



where v is a probability measure with generalized density 

Moreover, the limit is trivial if and only if 7 is a coboundary. 

The nature of the limit law depends on the expansion of t in basis d. 
In Section [8] we give examples showing that the limit distribution need 
not be Gaussian and can have atoms. 

In Section [7] we provide a precise algorithmically verifiable sufficient 
condition on the morphism a under which the limit distribution is in- 
deed Gaussian for almost all t. We thus arrive at the following theorem. 

Theorem 1.3. There exist infinitely many primitive substitutions o 
of constant length with eigenvalue 1 and eigenvector 7 such that, for 
Lebesgue almost every t G [0, 1], the following holds: 

(1) there exist positive constants a\, a.2 such that the variance V n = 
Var (S\_d n t\ (7)) satisfies the inequality 

ojin 4 / 5 < V n < a 2 n. 

(2) The sequence 

--S[dnt] (7) 



converges in distribution to the normal law N(0, 1). 

Our argument is an adaptation to our context of R. L. Dobrushin's 
central limit theorem |Dob] for non homogeneous Markov chains. We 
follow the martingale approach of S. Sethuraman and S.R.S. Varadhan 
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Conjecture 1.4. 

(1) For any primitive substitution of constant length with eigenvalue 
1, there exists t such that the limit distribution of 



is not Gaussian. 

(2) For any primitive substitution of constant length with eigenvalue 
1, for almost every t, 



converges to the normal law iV(0, 1). 

1.2.1. Affine interval exchange transformations. Our study was partly 
motivated by questions arising in the study of so-called Affine Inter- 
val Exchange Transformations. An interval exchange transformation is 
defined by the length A = (Ai, . . . , A r ) of the intervals !]_, . . . , I r and a 
permutation ir of {1, . . . , r}. It is denoted by Tn l7r )- An affine interval 
exchange transformation is a piecewise affine bijective map of the inter- 
val. As for interval exchange transformation it is defined by the length 
of the intervals, a permutation, and the slopes of the map (constant 
on each interval). This additional information is encoded in a vector 
(u>i, . . . , w r ) with Wi > for i = 1, . . . , r. Given these data, one can 
determine {aj, i = 1, . . . , r} such that for all z = 1, . . . , r and all x G Jj, 



To an affine interval exchange transformation T, we assign the corre- 
sponding log-slope vector 



A natural idea to analyze this class of dynamical systems has been to 
ask whether they are semi-conjugate to some interval exchange trans- 
formation (as a circle diffeomorphism is semi-conjugate to a rotation). 
A subsequent question is to ask if such a semi-conjugacy is a con- 
jugacy. Indeed, it is not always the case : affine interval exchange 
transformation may have wandering intervals. It is well-known that 
the problem of conjugacy reduces to estimates on the ergodic sums 
of interval exchange transformations (see for instance |CG| . [C] , |LMj . 
[BHMJ, [MMY2J). Together with this observation, our result yields 
new classes of examples corresponding to self-similar interval exchange 
transformation with eigenvalues of modulus one. 




1 




T(x) = WiX + ai. 
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Let T be an affine interval exchange transformation. One can assign to 
T its Rauzy expansion in the same way as for usual interval exchange 
transformations [ MMY2j . Let us consider the case where the Rauzy 
expansion of T is periodic with positive Rauzy renormalization matrix 
R. In this case T admits a semi-conjugacy onto the underlying self- 
similar interval exchange transformation T . Let (Ai,...A n ) be the 
lengths of T and as before, let (wi, . . . , w n ) be the slopes of T. Camelier 
and Gutierrez |CG] showed furthermore that 

n 

(1.3) ^2XilogWi = 0. 

i 

Conversely, given a self-similar interval exchange transformation To 



and a vector of slopes (wi, . . . ,w n ) satisfying (1.3) then Camelier and 
Gutierrez showed that there exists an affine interval exchange transfor- 
mation T with slopes (wi, . . . , w n ) semi-conjugate to T . 
We next consider the question of conjugation. Consider the decompo- 
sition of C n into characteristic spaces of the matrix t R: 

C n = E U ®E C @E S . 

Here E u corresponds to eigenvalues with absolute value greater than 
1, E s to eigenvalues with absolute value less than 1, E c to eigenval- 
ues on the unit circle. If T e E s , then Camelier and Gutierrez |CG] 
showed that T is conjugate to T . If T 6 E u , Camelier and Gutierrez 
|CG] and Cobo [C] gave examples in which T has wandering inter- 
vals. In pBHM] , it is showed that if T e E u and Galois conjugate to 
the Perron-Frobenius vector of t R then there is no conjugacy. Marmi- 
Moussa-Yoccoz |MMY2] obtained the same conclusion for generic in- 
terval exchange transformation ( |MMY2] ) using the positivity of the 
Lyapunov exponent of the Kontsevich-Zorich cocycle (proved by Forni 
P§!, Avila-Viana [SVj ). 



Theorem 1.1 implies 



Corollary 1.5. Let Tq = Tr\,w) be a self-similar interval exchange 
transformation and R the associated matrix obtained by Rauzy induc- 
tion. Assume that R has an eigenvalue 9 of modulus 1. 

• If 9 — ±1, let r be an eigenvector associated to 9 for t R 

• If 9 = belongs to C\R, let (resp. V-$) be an eigenvector 
associated to 9 (resp. to e~ 1 ^) for t R. Let Y be a vector with 
real entries belonging to the vector space generated by and 
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If an affine interval exchange transformation with vector of the loga- 
rithm of the slopes is Y is semi- conjugate to T then it is also topolog- 
ical^ conjugated to T . 

This corollary applies to many examples: 

Corollary 1.6. There exist infinitely many self similar interval ex- 
change transformation Tn^) on r > 4 intervals such that every affine 
interval exchange transformation that is semi- conjugate to Tr\,n) is con- 
jugate to T (A)7r) . 

We recall |Ve2] that the Veech group T(X) of a translation surface X 
is a discrete subgroup of SL 2 (M) obtained as the image by derivation of 
the group of affine diffeomorphisms of the translation surface X. The 
surface X is a Veech surface if its Veech group is a lattice in SL 2 (IR). 
An element of V is the derivative of a pseudo-Anosov diffeomorphism 
if and only if it is a hyperbolic element. A Veech surface is primitive 
if it is not a cover of any other translation surface. Assume that X is 
a primitive Veech surface of genus 2. The group T(X) is then defined 
over a quadratic field K ( |Mc] ) . We have the following proposition: 

Proposition 1.7. On each primitive Veech surface in genus 2, there 
exists a pseudo-Anosov diffeomorphism whose dilatation has two con- 
jugates of modulus one. 

Remark. In other words, the dilatation of our pseudo-Anosov diffeo- 
morphism is a Salem number. 



To a pseudo-Anosov diffeomorphism given by Proposition 1.7, assign 
an interval exchange transformation using Veech's zippered rectangle 
construction. The resulting self-similar interval exchange transforma- 
tion To then has the desired property: any affine interval exchange 
transformation T semi-conjugate to T is in fact conjugate to T . In- 
deed the unstable space is spanned by the Perron-Frobenius vector and 



by the result of Camelier-Gutierrez (see equation (1.3)), the vector of 
slopes cannot belong to the unstable space. If the vector of slopes be- 
longs to the stable space, then again results of Camelier and Gutierrez 
imply the existence of a conjugacy. If the vector of slopes belongs to 



the central space then Corollary |1.5 applies and again yields the ex- 



istence of a conjugacy. Others examples can be obtained using the 
square tiled surfaces with degenerate Lyapunov spectrum constructed 
by Forni |Fo2] and Forni-Matheus-Zorich |FMZj . 

1.2.2. Symbolic flows and deviation of ergodic averages. To prove The- 



orem |1.2| we introduce a family of Markov chains which provide ap- 
proximations of the ergodic sums. To give more specific statements 
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—but without entering now into details of the proof — let us consider, 
for each N G N and r G {0, . . . , d N }, the Markov chain X N > T ° defined 
on the state space A x A 2 x {0, . . . , d N } having probability transitions: 



Consider the map G T0 (a,V,m) = j(S) + j(P') ', we say that G TQ is 
a coboundary if it remains bounded on all paths of the Markov chain 
X N ' T °. 

Corollary 1.8. Assume that the chain X N,T0 only possesses one recur- 
rent component and set to = X)n>i T od~ nN ■ If is unbounded, then, 
as n oo ; the sequence of random variables S^ n toil/V^ converges in 
distribution to a normal law with positive variance. 

We prove by analyzing examples that the irreducibility assumption does 
not always hold and that the behaviour of the fluctuations at exponen- 
tial scale can be very complicated. For instance, we exhibit examples 
of substitutions a with an invariant vector 7 that is not a coboundary 
such that Sd«t{l) is bounded for some values of t. The combinatorial 
properties of the graphs of the corresponding Markov chains remain 
very unclear. Nevertheless, using results on non stationary Markov 
chains, we can prove that if there is to with a periodic expansion such 
that the associated Markov chain is aperiodic with positive variance, 
then fluctuations are normal for Lebesgue almost every t G [0, 1]. More 
specifically, we prove the following Theorem. 

Theorem 1.9. Let a be a primitive substitution of constant length with 
eigenvalue 1 . Let 7 be an eigenvector associated with the eigenvalue 1 . 
Assume that there exists N and r such that the Markov chain X N,T ° 
is recurrent aperiodic and the function G TQ is not a coboundary. Then, 
for Lebesgue almost every t G [0, 1], the following holds: 

(1) there exist positive constants a.\, a 2 such that the variance V n = 
Var(S^t\ (7)) satisfies the inequality 




d~ N if a N (a) = PbS with \P\ = m 

and a N (V) = P'WS' with \P'\ = m + r 



otherwise. 



ain 



4/5 <V n < a 2 n. 



(2) 



The sequence 




converges in distribution to the normal law N(0, 1). 
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The sequence V n is the variance of a non stationary Markov chain. The 
assumption (positivity of the variance for a specific tq) means that 
for the homogeneous Markov chain associated to to — Yl n >i T od~ , 
the approximation of the ergodic sums of 7 on this chain is not a 
coboundary. It is not very explicit but it is possible to check it on 



examples. We prove that the hypothesis of Theorem |1.9| is satisfied in 
many cases. 

1.3. Historical remarks. For deviations of ergodic averages of generic 
interval exchange transformations see |Zoj . |Foj . For holonomy flows of 
pseudo-Anosov diffeomorphisms for which the second eigenvalue 62 of 
the corresponding action in homology is real and satisfies the inequal- 
ity 62 > 1, limit theorems are obtained in [Bu2j. In this case, limit 
distributions have compact support. 

Let a be a primitive substitution over a finite alphabet A and 7 be a 
function from A to C. Let X a C A z be the subshift defined from a and 
x be a point in X a . The stepped line associated to x is the piecewise 
affine curve £(x) in K x C with vertices (n,j(x ) + . . . + 7(x„-i))neN 
and (n, 7(x__ n ) + . . . +7(x_i))„>i. Stepped lines were studied by many 
authors (see for instance |ArIt] and |ABB] and references there). They 
are closely related to the famous Rauzy fractals and to the fractal 
curves studied by Dumont and Thomas in |DTlj . |DT2j . Using a dif- 
ferent language, the result of Adamczewski |Ad] about discrepancy of 
substitutive systems describes the behaviour of 

(1.4) limsup(|7(x ) + . . . + 7WI) 

in terms of the eigenvalues of the matrix M a of a. 

1.4. Organization of the paper. Section [2] sets our notation on 



words, sequences, substitutions and suspension flows. Theorem |1.1 
is proved in Section 



while its corollaries |1.5| and |1.6| are proved in 
Section |4j In Section 5, we prove preliminary results about approxima- 
tion of ergodic sums by Markov chains. More precisely, we construct 
an explicit family of finite automata which allow us to code the ergodic 
sums as sums of nonhomogeneous Markov random variables. The main 
difficulty is that the underlying Markov chain may fail to be irreducible 



(examples are given in Section pi). In Section^ we prove Theorem 1.2 



In Section [7] we prove Theorem L9 In Section [8] we discuss various 



examples including those proving Theorem |1.3| (see Remark 8.7). 
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2. Background 

2.1. Words and sequences. Let A be a finite set. One calls it an 
alphabet and its elements symbols. A word is a finite sequence of sym- 
bols in A, w = w . . .wi_i. The length of w is denoted \w\ = £. One 
also defines the empty word e. The set of words in the alphabet A 
is denoted A* and A + = A* \ {e}. We will need to consider words 
indexed by integer numbers, that is, w = W- m . . . W-i.wo . . . we where 
f,m£N and the dot separates negative and non-negative coordinates. 
The set of one-sided infinite sequences x = (xi)^ in A is denoted by 
A N . Analogously, A z is the set of two-sided infinite sequences x = 

Given a sequence x in A + , A N or A z one denotes x[i,j] the subword of 
x appearing between indexes i and j. Similarly one defines x(—oc,i] 
and x[i,oo). Let w = u>_ m . . . W-i.Wq . . . We be a word on A. One 
defines the cylinder set [w] as {x G A z : x[—m,£] = w}. 
The shift map T : A z —> A z or T : A N —> A N is given by T(x) = 
for x = (xi)i e N- A subshift is any shift invariant and closed 
(for the product topology) subset of A z or A N . A subshift is minimal 
if all of its orbits by the shift are dense. 

In what follows we will use the shift map in several contexts, often in 
restriction to a subshift. To simplify notation we keep the symbol T 
all the time. 

2.2. Substitutions. We refer to |Qu| and [F] and references therein 
for the general theory of substitutions. 

A substitution is a map o : A — > A + . It naturally extends to A + , A N 
and A 7 ", for x = (xj)j 6 2 G A z the extension (which is a morphism of 
monoid) is given by 



cr(x) = . . . o-(x^ 2 )o-(x^ 1 ).a(x )o-(x 1 ) 
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where the central dot separates negative and non-negative coordinates 
of x. A further natural convention is that the image of the empty word 
e is e. 

Let M be the matrix with indices in A such that M a b is the number 
of times letter b appears in a (a) for any a, b G A. The substitution is 
primitive if there is N > such that for any a G A, a N (a) contains any 
other letter of A (here a N means N consecutive iterations of a) . Under 
primitivity one can assume without loss of generality that M > 0. 
A substitution a is of constant length d if, for all a G A, \o~(a)\ = d. 
Let X a C A z be the subshift defined from a. That is, x G X a if and 
only if any subword of x is a subword of a (a) for some N G N and 
a G A. 

Assume a is primitive. Given a point x G X a there exists a unique 
sequence (p iy q, Si) ie ^ G (A* xix A*) n such that for each i G N: 
a(c m ) = PiCjSi and 

• • • 0- 3 (j53)o- 2 (j9 2 )o- 1 (pi)po-CoSoO- 1 (si)o- 2 (s2)cr 3 (s 3 ) • • • 

is the central part of x, where the dot separates negative and non- 
negative coordinates. This sequence is called the prefix-suffix decom- 
position of x (see for instance |CSj ). 

If only finitely many suffixes Sj are nonempty, then there exists a G A 
and non-negative integers i and q such that 

x + = x[0, oo) = c s cr 1 (s 1 ) . . . o- e (si) lim a nq (a) 

Analogously, if only finitely many pi are non empty, then 

x- = x(-oo, -1] = ( lim a np (b))a m {p m ) . . . a 1 (p 1 )p 

n— >co 

for some b G A and non-negative integers p and m. 

2.3. Vershik automorphisms and suspension flows. In this sub- 
section, we recall the construction of Vershik's automorphisms |Verj . 
|VLj and their continuous analogues [Itoj . |Buj . We refer to Section 2, 
3, 5 of |Bu] for details and further references. Given an oriented graph 
T with m vertices, let S(T) be the set of edges of T. For e G £(T) 
we denote 1(e) its initial vertex and F(e) its terminal vertex. To the 
graph T we assign a non-negative m x m non-negative matrix A(T) by 
the formula 

(2.1) A = A(Y\ j = ${e G £(T) : /(e) = i, F(e) = j} 

We assume that A is a primitive matrix (in other words, that the 
corresponding topological Markov chain is irreducible and aperiodic). 
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We define the Markov compactum: 

Y = {y = Vx ■ ■ -Vn ■ • ■ ■ Vn e £(T), F(y n+1 ) = I(y n )} 

The shift on Y is denoted by &. Assume that there is an order on the 
set of edges starting from a given vertex. This partial order extends to 
a partial order on Y: we write y < y' if there exists I G N such that 
yi < y[ and y n = y' n for n > I. The Vershik automorphism T Y is the 
map from Y to itself defined by 

1 y = mmy . 

y'>y 

As A is primitive, there is a unique probability measure invariant under 
T Y denoted by /iy. 

We now define a suspension flow over (Y, T Y ) in the following way: let 
H be the Perron-Frobenius eigenvector of A. Then h t is the special 
flow over (Y, T Y ) with roof function r(y) = hi( yi ). The phase space of 
the flow is 

Y(r) = {(y,t):yeY,0<t<r(y)}. 

The measure /xy induces a probability measure z/p on Y T . The vector 
H is normalized in such a way that the space Y(t) have total measure 
1. 

For each e G £(r), the set 

{(y, t) : y G Y, y x = e, < t < hj^)} 
is called a rectangle in what follows. 

The space A = {x = . . . x_ n . . . x n ■ ■ ■ : x n G 5(r), F(x n+ i) = /(x n )} is 
the natural extension of (Y, S). The space A endowed with the Parry 
measure (the measure of maximal entropy) is canonically isomorphic 
to(Y(r)^ r ). 

Following Livshits |Liv] , we now connect Vershik's automorphisms with 
substitution dynamical systems. Consider the alphabet A = {1, . . . , m} 
as the set of vertices of T. For all a G A, we denote a (a) the sequence 
{F(e) : 1(e) = a} ordered with the partial order on {e : /(e) = a}. 
The dynamical system (X a ,T) is a topological factor of the Vershik au- 
tomorphism (Y,T Y ). The semi-conjugacy is given by the prefix-suffix 
decomposition. Almost every u G X a can be written in the form 

u = ■ ■ ■ a n (P n ) ■ ■ ■ cr(P 1 )P .a 5 cT(P 1 ) • • • a n (P n ) ■■■ , 

where, for all n G N, a(a n+ i) = P n a n S n . Thus, it is clear that such 
point correspond to the unique path in Y such that, for all n > 0, 
a n = F(y n+1 ) (and = I(y n ) for n > 0) and y n+ i has exactly \P n \ 
predecessors in the partial order around I(y n+ i). We recall that the 
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semi-conjugacy is not always a topological conjugacy because there 
may be multiple writings but it is a measurable conjugacy. 
When a has constant length, the roof function of the suspension flow 
is constant because *(1, . . . , 1) is an eigenvector for A associated to the 
maximal eigenvalue. 

We observe that the matrix M a of the substitution is connected to the 
matrix A by the relation M a = A T . 

3. Proof of Theorem 11.11 

Without loss of generality, we assume that M is a positive matrix. Let 
x G X a , 9 be an eigenvalue of M of modulus one and V = (7(a), a G A) 
an eigenvector associated to 9. We give a proof for 

lim inf | ( 7 (x ) + . . . + 7 W) I < C. 

n— >oo 

The other estimate is quite similar to obtain. 

The prefix-suffix sequence (pi, q, Si)^ of x is a path in a finite automa- 
ton (the prefix-suffix automaton, see |CS] ). Therefore, the prefixes and 
suffixes belong to a finite set V x S depending only on a. 

First Case We first assume that infinitely many suffixes are non empty 
in the prefix-suffix decomposition of x. Let k be a positive integer, by 
the prefix-suffix decomposition, we have 

X = ... <J k+l {pk+l)<7 n {Pn)v k ~ X {Pk-l) ■ ■ • O- 1 (Pl)P0-C0S O" 1 (si) 

...a k -\s k ^)a k {s k )a k+ \s k+l )... 

We denote by P k the word 0" fc_1 (pfc_i) . . . <J l (pi)pa and by S k the word 
CoSoc 1 (si) . . . o" A ''~ 1 (s fc _ 1 ). Thus we have 

x = ... a k+1 ( Pk+1 )a k (p k )P k .S k a k (s k )a k+1 (s k+1 ) . . . 

and 

CT k (c k ) = P k .S k . 

Assume that s k+ i is non empty. Then a(s k +i) contains c k because 
every letter appears in the image of every letter. We consider the first 
occurrence of c k in a(s k+ i) and get 

cr(s fc+ i) = rifeCfcSfe, 

where U k and H k are words of bounded length (the bound only de- 
pends on a). Consequently S k a k (s k )a k (Jl k )a k (c k )a k (T, k ) is a prefix of 

x + = x Xi . . . x n This word is equal to S k a k (s k )a k (Il k )P k S k a k (T, k ). 

Thus, the word W k = S k a k {s k )a k (Jl k )P k is a prefix of x + . 
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We extend the function 7 to the monoid A* by the natural manner. 



A* C 

t 

7 



w 1 ...w t ^ y^^jwj). 



w 

i=l 

It is clear that the extension of 7 is a morphism of monoid: if v and w 
are two words j(vw) = j(v ) + 7(10). 
The key lemma is the following 

Lemma 3.1. The sequence { , y{Wk))km is bounded by a constant that 
only depends on a and 7. 

Proof. By the hypothesis on 9 and 7, for every w G A* and m G N, 

7 (a m (w)) = 6 m -f{w). 

This implies that 

h(a m (w))\ = \i(w)\. 
We use this remark to calculate "i(W k ) = j(Sk) + k ^(s k ) + # fc 7(IT fc ) + 

7(A) = i(s k P k ) + 6^(3,) + # fc 7 (n fc ). 

Thus, 7 (W fc ) = 7(^n fc ) + fc ( 7 (s fc ) + 7 (n fc )) = 7(^ fc (c fe )) + 9 k (>y(s k ) + 

1 (u k )) = e k ( 1 ( Ck ) + 1 (s k ) + 1 (u k )). 

This yields that 

h(w k )\ < | 7 ( Cfc )| + |7( Sfc )| + |7(n fc )|. 

The length of the words c k , s k , U k are bounded independently on x. 
Thus, each factor on the right hand side of the inequality is bounded 
and the sequence (|7(Wfc)|)fc 6 N is bounded independently on x. 

□ 



Now we end the proof of Theorem |1.1| in the first case. We already 
proved that (W k ) k( z^ is a sequence of prefixes of x + . By hypothesis, 
the length of W k tends to infinity (because infinitely many suffixes s n 



are non empty). By Lemma 3.1 the sequence (iTC^fe) DfceN is bounded 



independently on x. This proves Theorem 1.1 in the first case 



Second Case If the suffixes of x are eventually empty, it means that 
x + belongs to the negative orbit of a periodic point for a. The proof 
is similar to the previous one. There exists a G A and non-negative 
integers I and q such that 

x + = c Sq(7 1 (si) . . . cr e (se) lim a nq (a). 
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Let Se = CoSocr 1 (si) . . . <J e (se); by construction of the prefix-suffix au- 
tomaton, there exists a letter c and a finite word Pe such that c/ +1 (c) = 
PeSi, thus 

x = ...P e .Se lim a nq {a). 

Since the letter c appears in the word a (a), there exists words of 
bounded length II and £ such that a (a) = Hc£. Thus for each n > £+2, 
S e a nq {a) = S e a nq - e - 2 cx e+1 {UcJ:) is a prefix of x + . Moreover 

S l a nq - l - 2 (a l+l (Ji))a nq - i - 2 (a l+ \c)) = S £ a n ^~ V +1 (n)K 9 ~^ 2 (P^). 

Set W n = S e a nq ~ £ - 2 (a e+1 (n))a nq - e - 2 (P e ); it is a prefix of x + . 
Now j(W n ) = 7 (5«) + 7 (n) +j(P £ ) = 7(^ +1 (c)) +7(n) = 7 (c) 
The absolute value of this quantity is bounded. The bound only de- 
pends on a and 7 since c is a letter and II belongs to a finite set (prefixes 
of cr(a)). 

4. Proof of Corollaries 11.51 and 11.61 



4.1. Proof of Corollary 1.5, Let T = Tn-x) De a self-similar interval 

exchange transformation on r intervals and T = (71, . . . ,7 r ) a vector 

orthogonal to A. The set of affine interval exchange transformations 

semi-conjugated to T( A)7r ) with slopes w = (ej, . . . , ej?) is never empty 

(see |CGj . [C]); this set is denoted by S a ff(T,w). 

Cobo gave an if and only if condition ensuring the existence of / in 

S a ff(T,w) which is not conjugated to T (see [C] page 392-393). This 

condition is the following: 

there exists a point x G X a such that 

(4.1) e l{x - n - x - l] + Y e" 7 ^ -^- 1 ) < 00. 

n>l n>l 

Otherwise every element of S a ff(T, w) is conjugated to T by a homeo- 
morphism. 



Theorem 1 1.1 1 says that, if T is an eigenvector associated to an eigenvalue 
of modulus one, then for every x G X a the sequence (7(2:0 • • • #n-i))neN 
possesses a bounded subsequence. Therefore, if 9 = ±1, 

e 7(a:_„...a;-i) _|_ g -7(a:o...x„_i) 
n>l n>l 

diverges for every x G .X^. 

If ^ 6 C \ 1, by assumption, there exist complex numbers a and 
b such that T = aV^ + frVl^. We apply Theorem |1.1| to the se- 
quences (V^Xq • • • x n _i))„ eN and (y_^,(x . . . a; n _i))„ eN . It is clear in 
the proof of Theorem 1 1 . 1 1 1 hat one can find a subsequence (n^) such that 
(V^Xq . . . x nfc _i)) fceN and (V-$(x . . . x nfc _i)) fceN are simultaneously bounded. 
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ADBC 
DCAB 



ADBC 
DCBA 



ABCD 
DBAC 



D 



ABCD 
DACB 



ABDC 
DACB 



Figure 1. Rauzy diagram. 



Corollary 1.5 



Thus equation (4.1) cannot be fulfilled which completes the proof of 



4.2. Proof of Corollary \1.G\ From Theorem it is enough to give 
an infinite family of self-similar interval exchanges transformations on 
4 intervals that have eigenvalues of modulus 1. The examples are ob- 
tained in the most simple Rauzy class (see figure [l]). 
We recall that a Rauzy class is the closure of a given pair of permu- 
tations under Rauzy induction and that self similar interval exchange 
transformations correspond to loops in the Rauzy diagram. Marmi- 
Moussa-Yoccoz gave an if and only if condition ensuring that a loop is 
realized by a self similar interval exchange transformation. In our situa- 
tion, the Marmi-Moussa-Yoccoz criterion says that a loop is admissible 
if all the labels A, B, C, D appear along the loop (see |MMY1| ). This 
condition will be fulfilled by the family of examples studied here. 
Every arrow of the diagram induces a substitution. Given an interval 
exchange transformation T on an interval /, let T' be its image under 
Rauzy induction acting on The substitution is defined by following 
the images of elements of the partition /' in the partition / under T 
until they come back to The product of these substitutions gives 
the substitution corresponding to the loop or to the self similar interval 
exchange transformation associated to the loop. Thus a loop is labelled 
by a finite word on the alphabet {A, B, C, D}. 
The symmetric permutation is 

ABCD 
DCBA 
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Proposition 4.1. For every n > 1, the matrix M n of the path starting 
from the symmetric permutation labelled by the loop DBBDC n DAAA 
has the following properties: 

(1) The Perron- Frobenius eigenvalue 8^ is an algebraic integer of 
degree 4- 



(2) It has two conjugate of modulus 1, = 8^' (it is a Salem 



number). 



(3) if 8 



(n) 
2 



n , then a n is an irrational number. 



Proof. The substitution on the alphabet {1,2,3,4} is 

14 

14224 

14(23)" +1 24 
14(23)"24. 



a,, 




The corresponding matrix is 

/l 




V 



1 

2 

2 



1 

n + 2 
n + 1 
2 



1 \ 

n + 1 
n 
2 



The characteristic polynomial of M n is 

p n = X A - (6 + n)X 3 + (10 + n)X 2 



(6 + n)X + l. 



By the general theory of interval exchange transformations (see |Yo] 
|Zo2] ). P n is a reciprocal polynomial. Thus, the roots 81,82, 83, 84 of P n 
satisfy #4 = l/9i, 83 = 1/6*2 and 61 > |0 2 | > l^l > 6*4 (as n is given, 
the dependence on n is omitted in the sequel). 

We first prove that P n has two complex roots of modulus 1. Let t = 
81 + I/61 and s = 8 2 + 1/62- We remark that 82 and 83 are complex 
conjugate if and only if — 2 < s < 2. In fact, if 82 is real and larger 
than 1, s > 2; if # 2 has modulus 1, 8 2 = e 2ma and s = 2cos(27ra). 
A simple calculation yields the equations t + s = 6 + n and ts = 8 + n. 

Thus, S - 6+n-Vn 2 +8n+4 

(-2,2) 

Now, we prove that P n is irreducible. Otherwise, 82 is ±1 or an alge- 
braic integer of degree 2. A direct computation proves that ±1 are not 
roots of P n . Moreover, the only irreducible polynomials of degree 2, 
with roots of modulus 1 are the cyclotomic polynomials X 2 + X + 1, 
X 2 + 1, X 2 — X + 1 (the polynomial has the form X 2 — sX + 1 with 
— 2 < s < 2). To prove that P n is irreducible, we check that it is not a 
multiple of X 2 + X + 1, X 2 + 1, X 2 - X + 1. 



. We easily check that this quantity belongs to 
independently on n. Thus 8 2 has modulus 1. 
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We end the proof by showing that a is irrational. If a is rational it 
means that 62 is a root of unity. This is impossible since Q\ > 1 is a 
conjugate of 62- 

□ 



4.3. Proof of Proposition 1.7[ Denote by r the non-trivial Galois 



automorphism of K and by T(X) the Veech group of X. Given A 
a b \ c r :+„ n„i„; n ^ - M G ) r (°) 



c ^ G T, its Galois conjugate is r(A) = y T ^Jj r{d)j ' denote 

by T' the image of T(X) under Galois conjugacy. Let be an affine 
diffeomorphism of X and A the corresponding element of the Veech 

group, its action on Hi(X, R) is by ^ r(A)J * n a su ^ a ^^ e basis. 

The group P is a non discrete and non elementary subgroup of SL^M). 
By a result of Beardon [BeJ, it contains an elliptic element of infinite 
order. Let A' = t(A") be this element. If A" is hyperbolic, then 



Proposition 1.7 follows. We now show that A" cannot be elliptic nor 
parabolic. As T(X) is discrete, every elliptic element ofT(X) is of finite 
order. But A' = t(A") is an element of infinite order. Consequently, 
A" is not an elliptic element. Now the image of a parabolic element 
under Galois conjugacy is again parabolic (since an element of SL 2 (M) is 
parabolic if and only if the absolute value of its trace is 2), so A" cannot 
be parabolic. Therefore A" is indeed hyperbolic and is the derivative 
of a pseudo-Anosov diffeomorphism with the desired properties. 



5. Asymptotic behaviour of ergodic sums 

5.1. Ergodic means. We are interested in the distribution of the er- 
godic integral 

/ / h s (x)ds 
Jo 

considered as a random variable on the probability space (X, up) and 
normalized to have variance 1. 

The first observation is that for all n G Z, the law of x is the same 
as the law of & n (x) by (5-invariance of the measure u-p (we mean: the 
law of Id is the same as the law of & n ). Hence the distribution of the 
ergodic integral is the same as that of the sequence 

/ foh s (e- n (x))ds. 

Jo 



Let us state this result more formally: 
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Lemma 5.1. For any measurable function <p : Y — > R 7 all z G R and 
all m G Z, 

u({x G Y : <p{x) < 2}) = u({x G K : (p(& m x) < z}). 

From here till the end of the paper a is a substitution of constant 
length d, thus Ai = d. We choose a function / constant on rectangles 



(defined in Subsection 2.3). We are interested in the case when this 
function "corresponds" to the coordinates of an eigenvector associated 
with eigenvalue 1, i.e. an invariant vector (ie f(a(a)) = f(a) for every 
letter a). 

5.2. Decomposition of unstable leaves. Pieces of unstable leaves 
on which we compute the ergodic averages cross transversally the "rect- 
angles" of the partition. To such a piece of unstable leave we associate 
a symbolic sequence given by the names of the successive rectangles it 
does cross. The self similar structure implies that these symbolic se- 
quences are words in the language of the substitution a. More precisely, 
such a piece of leaf starts inside a rectangle (the initial rectangle) then 
crosses transversally a given number of rectangles, to end up inside a 
last rectangle (the final rectangle). Here, we try to describe the sym- 
bolic sequence using the self-similar structure. We have to take care of 
the initial and final rectangles which are not completely crossed. For 
(important) technical reasons we distinguish not only the final but the 
last two rectangles. 

Let x G Y and t G [1; 00). We consider a piece jf of unstable leave de- 
termined by its starting point x and its length t. The sequence (x„)„ e z 
determines a bi-infinite sequence of prefixes/center/ suffixes (P%, c*, S%) 
satisfying for all n G Z, F(x n ) = c x n and cr(c^ +1 ) = P^c^S*- We observe 
that around x the symbolic sequence (sequence of codes of rectangles 
seen along the unstable leave through x) is (note that this decomposi- 
tion stops if all suffixes are empty) 

= • • ■ a n (PZ) ■ ■ ■ a(PnP x .c x SSa(Sf) ■ ■ ■ a n (S x n ) 

We compute the distance from x to the boundary of the rectangle in 
which it is by t~ = ^2 n<0 d n \S%\. 

We decompose (in basis d) : t = J2 n <M & nj n an d let k t = [t\ — 1 = 
J2n=o d n T n — 1. We observe that jt(%) crosses at least k t rectangles. 
It does cross one more only if J2n<od n \P%\ + J2n<od nr n > 1 (which 
happens only if there is H such that for all h < H, \Pl h \ + T-h = d — 1, 
and \P1 H \ +r_ H > d). 

We denote K — K t the greater integer such that d K < k t . We ob- 
serve that £ x = \SQ<r(Sf) ■ ■ ■ cr K (Sf c )\ < k t . We also observe that 
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there is a letter W such that \S$<r(Sf) ■ ■ ■ a K (S x K )a K+ \W)\ > k t and 
that S%a(Sf)---a K (S x K )a K+1 {w) is a factor of u x . We decompose 
the prefix of length kt — £ x of a +1 (W) into prefixes in such way 



that Eto^fl + Eto^l^l = k and [Ito UIk 



1 ' " W fcr 

Then, we set t + = t — k — t . We have to check if whether t + < 1 or 
1 < t + < 2. We use the identity f(a(a)) = f(a) to get, if t + < 1, the 
following equation: 



(5.1) 



and, if 1 < t 
(5.2) 



/ o h s (x)ds = f(a )t' 
<2, 



K 

E 

71=1 



ft K 1 

/ /ofc,(x)ds = f(a )t-+J2f(S X n)+J2 f(P:> t )+f(*k) + (t + -l)f(a k+1 ). 

^° n=l n=K 

It is important to notice that except for the boundary terms, the de- 
composition depends on x through (x„) n >o and of t through [t\ . The 
other data tell about what happens more precisely in the boundary 
terms. We get 



(5.3) 



r t ( K 1 

^° \n=l n=K 



< 3 



If we step to d n t and shift the starting point to a~ n (x), we will ob- 
tain the same decomposition and something more precise on boundary 
terms. Hence, to get recursively to the next step we only have to keep 
track of eg and the word dk t ak t +i- To encode this information, we define 
a family of automata which is described in the next paragraph. 



5.3. A family of automata. The following family of automata asso- 
ciated with the substitution a will allow us approximate the ergodic 
averages in a Markovian way. 

For all W G A and all integer 1 < m < d, we decompose a(W) = 

pW,m c W,m S W,m w j th pW,m & Jrn-l ^ c W,m & A &nd gW.m & J^d-m 

For all r G {0, . . . , d — 1} we define an automaton A T . States are 
couples (a, W) E Ax A 2 and edges are labelled by a real number v and 
an integer m. For all v G R, m G {1, . . . , d} and integers and a, 6, G A, 
V,W G A 2 we put a labelled arrow 



(a, V) ^ (6, W) 
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if b = c a ' m , W = c v,m+r c v,m+T+i and y = ni (e sa , m + epv.m+r). Here the 
word u{V) has length 2c?, thus it has a prefix of length m + r. Observe 
that in particular, <r(a) = p a > m bS a ' m so that the outgoing degree d 
of any vertex is the same as the outgoing degree of the corresponding 
vertex in the prefix/suffix automaton. Note that for r = we can forget 
the second letter of V and W; indeed since m < d, c v ' m is determined 
by the first letter of V. 

Examples of such automata are described in Section [8] The readers 
should refer to this section to understand concrete examples. 

5.4. Ergodic sums and automata. We fix i G [^',d) and write t = 
J2 n <o Tn d n (note that r > 0). We let W be a word of length d i.e. 
W G A d and choose x G Y such that uj x G [W], i.e. Uq ■ ■ ■ u)%_i = W or 
again F(x )F((T Y x)o) ■ ■ ■ F((Ty~ 1 x) ) = W. The sequence (x n ) ne z de- 
termines a bi-infinite sequence of prefixes/centres/suffixes (P*,c^,S*) 
satisfying for all n G Z, F(x n ) = & n and a(c^ +1 ) = P%c*S*. 
We observe that around x the symbolic sequence (sequence of codes 
of rectangles seen along the unstable leave through x) is (except if all 
prefixes are empty) 



We note that Cq = Wq. In the simpler case, W is the prefix of length 
d of c^SQ<j{Sf) ; or of CgSgcr*(Sf ) where Sf is the first non empty 
suffix after Sq. If all suffixes are empty, then we must look at Ty(x). 
Anyhow, we set a := eg = W and V° := W T0 W T0+ i. We also set 
U° — Wi ■ ■ ■ Wro-i (may be empty, if r = 1). 

We construct recursively a sequence (a n , V n , U n ) n > . Assume (a n , V n , U n ) 
determined for some n. Let then m be the length m = |Pf/ n+1 J + 1(= 



Remark 5.2. T/iis sequence (a n ,V n ) is obtained by the family of au- 
tomata described in the previous section. The sequence of automata 
depends on the digits oft. 



Since / depends only on the first coordinate, for brevity we write f{W) 
for the sum f(W x ) + ■■■ + f{W k ) if W = W x ■ ■ ■ W k . We set t n _ = 
dn E k <-n dk \ S -k\ and t n + = d n J2 k< _ n d k (r k + \P%\) and observe that 
< tl < 2. 



= ° n (P X n) ■ ■ ■ v{P?)Plc*S*o{St) ■ ■ ■ a n (S:) 



d- \S : 



(n+l) 



|). We set 
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Lemma 5.3. Ift'1 < 1, then 

"d n t 

f o h s (&- n x)ds = f(a n )t n _ + f(U n ) + tlf(V?). 
Ift n + >1, then 

[ dt fo h s {&- n x)ds = f{a n )t n _ + f{U n ) + f(V?) + (tl - l)f{V 2 n ). 
Jo 



In particular, we have 

r-d n t 



foh s (e- n x)ds-f{u r 







< 3 



Proof. The symbolic decomposition around & m (x) is 



OO 



J m+n) 



Hence for m = —n, recalling that u x G [W], we obtain (looking at the 
right hand part) 



c x _ n Sl n a(Sl n+1 ) ■ ■ ■ a^iS^iW, ■ ■ ■ W t 



By construction, U n is a prefix of this word (shifted once). It re- 
mains to compute its length. We observe that \U°\ = r — 1. But 



U 



n+l\ 



\s x 



(n+l)l 



+ d\U n \ + rr n+1) + m - 1 = d\U n \ + d + r_ (n 



(n+l)- 



r k d k 



1, we obtain \U 



n+l I 



Hence if we assume \U n \ = d n (^2 k 
^HEL-n r/)-rf+d+r_ (n+1) = EL„ (t!+ i) r/-l(= [d^H\- 
1). In view of the definitions of and t™, combining this discussion 
with equations (5.1) and (5.2), we get the statement of the lemma. 

□ 

To pass from limit theorems for the Markov chain to those for the 



ergodic integral, we need to combine Lemma |5.3| with the following 
simple observation. 

Proposition 5.4. Let x[ n ^ and X 2 , n G N, be two sequences of 
random variables on a probability space (Q, P) satisfying the conditions: 

(1) there exists a constant K such that for all n we have the in- 
equality \Xi — lj | < K almost surely; 

(2) Var(X { 2 
Then 



— > oo as n — > oo. 



Var(X^) 



lim 
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Proof. First, note that we only need to consider the case when EX[ n> = 
EX^ ] = 0, since if x[ n ^ and -X^ satisfy the assumptions of the Propo- 
sition, theasodoX[ n) -EX l (n) and X^ n) -EX^ n) . HEX[ n) = EX^ n) = 0, 
then we have 

|E(x{ n) ) 2 - E(Z 2 (n) ) 2 | < K (K + E|x{ n) |) <k(k+ ^Var(X^ 

by the Cauchy-Buniakovsky-Schwarz inequality, and the proof is com- 
plete. □ 

6. Markov approximation and the proof of Theorem 11.21 

The following construction plays a basic role in the sequel. 

6.1. Markov chains. Let t G Let n be the unique integer such 
that d n °t G [1; d) and set t = ^2 k<no T k d k . For now, we work with 
n = 1. Let (Q, J 7 , P) be a probability space. We construct a random 
variable I on Q valued in A d with distribution 

P(I = W) = v{{x eY:u%---u% = W}). 

We define a map g* : A d -> R by g {W) = f{W 1 ■ ■ ■ W fct _i)- Let now 
( m n)n>i be a sequence of iid random variables uniformly distributed in 
{1, . . . , d} independent of /. We define a process valued in £ = A x A 2 
by setting X = (J 1( I k I k+ i) and recursively X n+1 = (a n+1 , V n+l ) to be 
the end vertex of the edge labelled m n of the automaton A T _ n starting 
from X n = (a n , V n ). Finally, we set 



(6.1) m n ) = f(S an ' m ") + /(P^+-«). 

Lemma 6.1. The sequence (X n ) n > is a Markov chain. 
We have the following commutative diagram 

gN ^ gN 

Y 4 Y 

where ©t is the shift on the Markov chain. The probability measure P 
invariant by the Markov chain projects to v. 

Proof. (X n ) n > is a Markov chain since (a n+ i, V n+ i) only depends on 
(a n ,V n ). Moreover, there is a projection it between infinite paths of 
(X n ) and Y. This map consists in forgetting the second variable V n . 
Formally, the map is defined by : m n = \P*\ + 1. To check that the 
image measure is v under the projection, it suffices to observe that 
^2vgA 2 P(Xo — (O) V)) — v({x eY:Uq= a}) which is obvious. □ 
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We claim that 
Lemma 6.2. 

n 

f(U n ) = g (X ) + J29k(X k ,m k ). 

k=l 

Proof. It is a straightforward computation. Check that f(U°) = g (X ). 
Then write recursively U n+1 = S^^ n+1 ^a(U n )V" ■ ■ ■ V™_ n+m _ v Hence 

f(u n+1 ) -/(c/-) = f(sz in+1) ) + f(v 1 n ---v^_ n+m _ 1 ). n □ 

Now, we assume that t has a periodic expansion thus the Markov chain 
is homogeneous. To simplify notations, we will forget the dependence 
in t. Up to a change of basis, we assume that the expansion of t has 
period 1, all the digits are equal, i.e. for all k, r k = r. 

6.2. Proof of Theorem D 

The Markov chain (X n ) is endowed with an initial probability measure 
fiQ defined on A x A 2 and a transition matrix Q. 

Lemma 6.3. Let B = {V E A 2 , v{[B}) > 0}. The Markov chain (V n ) 
is recurrent on A x B. 

Proof. The proof is straightforward since the substitution is primitive. 

□ 

Observe that the processes (a n ) and (V n ) are also Markov chains. Let 
jx be an invariant measure for the Markov chain (X n ). 

Lemma 6.4. The initial measure liq of the Markov chain (X n ) projects 
on the first coordinates on the invariant measure for the Markov chain 
(a n ) and on the last coordinate on the invariant measure for the Markov 
chain (V n ). 

Proof. The initial measure satisfies: 

Li (a, m, V) = P{h = a, h t i kt+1 = V)P(m = m). 

It projects on /ig(a) = v([a]) and on Hq(V) = u([V}). These measures 
are obviously invariant under the Markov chains. □ 

An important lemma is the following: 

Lemma 6.5. For every integer n and for every probability measure p, 
invariant by the chain (X n ), we have E[g n (X n )] = and Ep i [g (X )] = 
0. 
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Proof. We recall that E V [L 1 foh s (x)ds] = since E[f] =0 by hypoth- 
esis. This does not a priori imply the conclusion of the Lemma since 
the Markov chain (X n ) is not always recurrent. We must exclude the 
possibility the averages on different ergodic components compensate 
each other. 



By Lemma [6 .2[ there exists a constant K such that, independently on 

n, 

n 

(6.2) E[Y^9k(X k ,m k )]<K. 

k=l 

Since expectation is a linear operator, and in view of the definition of 
g n (recall equation ( 6.1[ )), we can decompose 



(6.3) E[g n (X n ,m)]=E[g 1 n (a n ,m)]+E[g 2 n (V n ,m)]. 
Let fl be any invariant measure under (X n ). We have: 

Eji[go(X ,m)} = Ep[g n {X n: m)\ = E^g^a^m)} + Ep[gl{V n ,m)\. 



By Lemma 6.4 and equation (6.3) this yields 



Ep[go{Xo,m)\ = E[g l n (a n ,m)] + E\g*(y n ,m)] = E[g n (X n ,m)}. 



Therefore, E[g n (X n , m)] is independent on n. It follows from (6.2) that 
E[g n (X n ,m)) = and E^[g (X ] = 0. 

□ 

We just proved that for every recurrent class the mean of the invariant 
measure supported by this class is equal to zero. Thus the central 
limit theorem for stationary Markov chains implies the random variable 
77^ Sfc=i 9k{Xk, TUk) has a limiting distribution. The contribution of 
each recurrent class is a normal law. 

Remark 6.6. Observe that, as a consequence, for every positive e, 

1 n 

-^—y2gk(x k ,m k ) 

n ~ 2 k=i 
converges in distribution to 0. 



To finish the proof of Theorem 1.2, it is enough to show: 



1 [ dnt (d) 1 n 

(6.4) lim —= I foh s {x)ds= lim —= }g k (X k , m. ) 

V Jl> V k=l 



In other words : 
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Proposition 6.7. For all bounded continous tp : R — > R ; 



1 f dnt 

lim / 93 ( -j= j f o /i s c/s ) 1/ = lim E 



(^^^9k(X k ,m k )\ 



Proof. By Lemmas 5.3 and 6.2, the quantities J Q / o h s (&~ n x)ds and 



Sfe=i 9k(Xk, mk) are equal up to a bounded term. As we divide by 
y/n a bounded term is negligible. Thus the limit distribution of the 
sequences 

7= / / o h s (&~ n x)ds and -= VV(X fc , m k ) 
In In Jn ^— ' 

1/0 v fc=i 



are the same. 



By Lemma 5.1 the law of 

rd n t 

f o h s (x)ds 

Jo 

is equal to the law of 

-d n t 

f o h s (e~ n x)ds. 



This proves the proposition. □ 



Together with Proposition 5.4, this implies Theorem 1.2 



7. Proof of Theorem 11.91 

7.1. Assumptions on the Markov chain. In the beginning of Sec- 
tion [5j to a substitution a we assigned a family of automata {Ai, i G 
{1, . . . , } in such a way that to any t G [0, 1] with decomposition in 
basis d (set A — {0, . . . , d — 1}) given by t = J2 n >o T nd~ n we associate 
a sequence of automata in the family, A TQ , A Tl , .... To such t we also 
associate a sequence of functions (g\) defined on the state space E of 
the underlying Markov chain (g k = g Tk ). When t has a periodic ex- 
pansion in basis d, the sequence of automata is periodic and hence the 
associated Markov chain is a standard homogeneous Markov chain. 
Here we assume that there is such a to for which the Markov chain is 
aperiodic and has positive variance. Let us first explain the meaning 
of this assumption. We stress that it is not only an assumption on 
the Markov chain but also on the functions {g\). Let x — Xi'"Xp 
denote a word in A p such that to = X]n<o(Sf=i Xid~ l )d~ np . First of 
all, we assume that the matrix of the Markov chain associated with t 
is primitive (without loss of generality, we can assume that the matrix 
has strictly positive entries). The positive variance asumption means 
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that the sequence of functions g to does not yield a coboundary. Hence 
there are two cycles on which it takes two different values. 
In slightly different words, it implies that there are two paths u and v 
in E p+l (with uq = vq and u p 



v p ) and some e > such that 



(7.1) 



k=l 



V 

£ 

fc=i 



(9xk( v k)) 



> e > 0, 



We claim that under this assumption, for Lebesgue almost all t, 
EVo(9l(X k ) - E[g{(X k )}) 



(7.2) 



N(0,1). 



As we shall see more precisely in Section 



6.2, convergence (7.2) implies Theorem 



7.4 



1.9 



by Lemma 5.3 and Lemma 



7.2. Choice of a full-measure set of admissible sequences. Ob- 
serve that the Lebesgue measure on [0, 1] maps onto the uniform prod- 
uct measure on A N by the map sending a real number to its expansion 
in basis d. Hence, with probability 1, the expansion of t contains in- 
finitely many occurrences of k = x 3 (we stress that we ask for three 
successive occurrences of x)- 

Denote (q n ) the sequence of occurrences of k in the expansion of t and 
m n = q n + 3p (recall p = \x\ = M/3 ; fix m = 0). The sequence 
(m n — m n _x) is a sequence of independent identically distributed ran- 
dom variables with geometric law. We claim that it does not grow too 
fast. 

We use Borel-Cantelli Lemma to say that the inequality B n = {m n — 
m n _i > n±} almost surely holds only for a finite number of n G N 

since En>i P ( 5 «) = E„>i c ^™ 1/4 < °o. Let J C [0,1] be the set 
of probability 1 on which the expansion of t contains infinitely many 
occurrences of k and B n is true a finite number of times. Observe that 
for all t G J, there is C (depending on t) such that for all fiGN, 



(7.3) 



m n — m„_i < Cn 4 . 



7.3. A Central Limit Theorem for non homogeneous Markov 
chains. 

7.3.1. The Dobrushin Theorem. We recall Dobrushin's central limit 
theorem for non-homogeneous Markov chains (see [Dob]). We follow 
the exposition by Sethuraman and Varhadan [SVJ. We only state the 
theorem for Markov chains defined on a finite state space, since that 
result is sufficient for our purposes. 
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For each n > 1, let {X^ : 1 < % < n} be n observations of a non- 
homogeneous Markov chain on a finite state space E with transition 
matrices {^^i : 1 < i < n — 1} and initial distribution 
Recall that, for discrete Markov chains, the Dobrushin's ergodic coef- 
ficient of a transition matrix tc is defined as a(7i) = 1 — 5(tt) where 



6(n) = sup |vr(a, c) 

a,b£E;c£E 



n(b,c)\. 



Recall the well-known inequality: 



(7.4) 



a{7r) > min 7r(a, c) 

a,c£E 



Moreover if 77 is a continuous function its oscillation is Osc(rj) = sup^ w> \r]( 
77(0/) | . If 77 is a function defined on the states of a Markov chain, we 
have the property : Osc(nr]) < S(7i)Osc(rj). 

Let a n = mini<j< ra _i a^-™^). Let {/j : 1 < i < n} be real valued 
functions on E such that there exists some finite constants C„ with 



sup 1<i<n sup xeX I if \x^ n) )\ < G, 



Define, for n > 1, the sum 



i=l 



Theorem 7.1 (Theorem 1.1 in [SVJ). // 



lim Cla J 



J2v(ft\xt ] )) 



then, we have the standard normal convergence 

S n — E[S n ] 



N(0,1) 



7.3.2. Decomposition for mixing. Let now t G J be fixed. Most quan- 
tities depend on t but, from now on, we omit the sub/superscript. Let 
(Tn)n>o be the expansion of t. We denote p n the transition matrix as- 
sociated with the automaton A Tn and p„ jm = p n ■ ■ -p m -i the transition 
matrix between times n and m. For all n consider the word W n = 
r m „_j • • • T mn _i. This yields a decomposition of our sequence in words 
ending with k = XXX- We define the (non homogeneous) Markov chain 
Y n = X mn _ x . We denote by 7r n its transition matrices, 7r n = p mn _ 1)mn - 
The function we want to control is a function of (X mn _ x , . . . ,X mn _i). 
Let us set Y n = (X mn _ 1+ i, . . . ,Jf mn _i). Observe that, conditionally 
to Y n and Y n+ i, Y n is independent of {Y iy i e N \ {n,n + 1}} (and of 
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{Yi,i GN \ {n}}). We introduce 

m„— 1 m n —l 

f n (Y n ,Y n ) = 9k(X k )-E[ ]T 9k{X k )}. 

k=m n -i k=rn n - 1 

Remark 7.2. Ideally we would like to use Dobrushin's result about 
sums of observables of a non stationary Markov chain. The difficulty 
is that the functions do not depend only on the Markov chain itself. 
We could represent this quantity as a function of Y n , Y n+1 and another 
random variable independent of the chain. Still it is not enough to 
apply the theorem in its standard versions. The usual way to avoid 
this problem would be to build a new chain joining coordinates by pairs 
((Y n , Y n+ x))] but this construction breaks down the lower bound for the 
ergodic coefficient. This does not change the result fundamentally but 
we have to explain how the proof has to be adapted. 

We state the main results of this section as follows. We denote F n = 
f n (Y n , Y n ) and S n = Y^k=i ^k- The positive variance assumption allows 
to show that, for all t E J, 

Lemma 7.3. There is 7 > such that 
(7.5) V (S n ) > 7 n. 



The proof is given in Section 7.7 Relying on this result, we show that 
Proposition 7.4. For all t G J, 

> N(0, 1) 



The remainder of this section is devoted to the proof of Proposition 



7.4 We are going to adapt Dobrushin's Central Limit Theorem for 
inhomogeneous Markov chain (following the exposition by Sethuraman 
and Varhadan |SVj ) to our special case. But we first show how it yields 
Theorem 11.91 



7.4. Proof of Theorem \1.9\ We are now in position to prove Theo- 
rem H"9l Let us fix t G J. 



Lemma 5.3| and Lemma GJ2 show that we can apply Proposition ^4 



to get eq uiva lence of the variances V n introduced in the statement of 
Theorem 1.9 and V(Y^k=o 9k(Xk)))- Since the g\ belong to a finite 
family of bounded functions, we easily get an upper bound for the 
variances: V(Y^lZo fliP^fc)) — su Pfc>o I Idll \oo n < a 2n ■ For the lower 



bound, we have to be more careful. On the one hand, Lemma 7.3 



shows that there is 7 > such that V (S n ) > 'yn. On the other 
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hand, we use the relationship (7.3) between n and m n to observe that 
m n < n.C.n 1 ^ < Cn 5 / 4 . Hence V^^' 1 g{{X k ))) > 7 (m n /C) 4/5 and 
(for m n <m < m n+1 ), V^Zo 9kW)) > l{m n /C)^ > 7 ((m - 
m 1 / 4 )/C) 4//5 > «in 4//5 . This proves statement (i). 



We use again (7.3) and boundedness of the family (gl) to check that 
for m n <m< m n+1 , \ YlT^o 9k(Xk))) — S n \ < const, n 1 / 4 . In view of 



the variance lower bound, Proposition 7.4 yields 



(76) ESfaiW-flbiW]) , mi) 



Lemma |5.3| a nd Lemma 6.2 show that (7.6) yields statement (ii) of 
Theorem 11.91 

We conclude recalling that J has full Lebesgue measure. □ 

7.5. Standard assumptions. Even though our situation is very sim- 
ilar, we cannot apply directly Dobrushin's Theorem because our sum 
runs on observables that are not functions of the state of the (mixing) 
Markov chain. Nonetheless, our argument follows the general outline 
of |SV] . We use the same interplay between boundedness, rate of mix- 
ing and variance. More specifically, in our case, the ergodic coefficient 
is constant (bounded away from ; so we need just to ensure that 
Cl/V(S n )^0). 

7.5.1. Upper bound. The upper bound 

C n = sup sup fi(z,z) < sup(||^||oo(mi + i - mi)) < Ci.n l/i . 

i<n z,z i<n 

In other words, 

C n = sup Fi < Cm 1 '*. 

i<n 

7.5.2. Ergodic coefficient. The ergodicity coefficient is bounded from 
below uniformly because of the presence of the word k at the end 
which guarantees some mixing. Let us be more specific. 



By inequality (7.4) since all the words W n end with the word k (asso- 
ciated to a (power of a) primitive matrix), p q „, mn is uniformly bounded 
from below (by a constant a), and we have 

n n (a,b) > ^p mn _ 1>qn {a,c)p qnjmn {c,b) > ^Pm„_i, g „0, c) a > a. 

From which we deduce that a(vr n ) > a uniformly in n. This holds for 
the Markov chain (Y n ) n > . 
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Remark 7.5. We stress a technical difference with |SV] . The point 
is that, as in [SVj, we have Osc(E[f(Yj, Yj)^) < a J '- i Osc(/(F i )) ; 
but that we will need a bound on Osc(E[fj(Yj,Yj)\Yi, %]) where Yj 
depends on what happens between m;_i and — 1. Since we need 
only asymptotic decay, i.e. for j — i large, we will simply shift by 1 
using that 

OsciElf^Y^Y]) < OsciEif^Y^Y^}). 



7.5.3. Variance. The variance of each elementary contribution is uni- 
formly bounded from below. Furthermore the sum of the variances 
of the first n terms is increasing rapidly enough with n. In the usual 
setting the last statement follows from the first one using assumption 
2. In our case, it seems more convenient to prove directly the second 



statement. That is the object of Lemma 7J3 which claims that there is 
7 > such that V (S n ) > yn and whose proof is postponned to Section 

7.5.4. Outline of the argument. The crucial point for the proof (fol- 
lowing |SVj ) is the interplay between these three quantities. Roughly 
speaking the variance must grow fast enough to kill unboundedness and 
lack of ergodicity, as expressed by equation (1.3) in |SVj . In our case 
the ergodic coefficient is independent on n so it is enough to have the 
variance growing faster than the upper bound. To draw the parallel let 
us write : 

lim C 2 n a-*V~ l = lim C 2 n 1/2 p- 3 — = 0. 

But we stress again that the situation is slightly more tricky because 
the observable we are interested in is not a simple function of the 
states of the Markov chain for which the assumptions are fullfilled (f n 
depends on Y n , but also on Y n ). That is why we treat the variance 
separately. For the remainder of the proof, we can not just quote |SV] 
but it appears that we can follow exactly the same lines. To construct 
the Martingale approximant, we take Y n into account. To prove that 
the assumptions of the main Martingale differences CLT arguments are 
fulfilled, we use the same ideas based on the control of the oscillations 
of the conditionnal expectations of f n (Y n , Y n ) which rely essentially on 
the summability of the long range correlations and on the sublinearity 
of the upper bound (together with the linear growth of the variance). 



7.6. Proof of Proposition 7.4[ The central idea is to use the follow- 
ing standard CLT (implied by Corollary 3.1 in [HHJ) for martingale 
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differences : if = Xw =2 £^ * s a martingale with respect to a 

filtration T k and if, 

(7.7) max Halloo ^ 0, and, 

l<i<n 



(7.8) ^^m^-^linL 2 , 

i=i 

then, 

Mi n) N(0,1). 

The quantities whose fluctuations we study are not a martingale mainly 
because increments are not independent for a Markov chain in that 
they depend on the present state. It is rather standard to bypass this 
difficulty by introducing 



(7.9) = Y,m^Yi)\Y k ,Y k ] 

i=k 



Then considering the scaled differences 



An) 



we define the (family of) processes 

M t ] = Et 2 d n) and observe that 
the processes (M^)i< k < n are martingales with respect to the nitrations 
{J^k, 1 < k < n}, where T k = a{Yi,Yi : 1 < I < k}. Indeed, since £^ 
is J-fe-measurable, for k < n — 1, 

i=2 

= Mi n) + (VVm)-\E \zl% - E[zi% I Y k , Y k \ | ? k ] ) 



= Mi n) + (VWn))- l (E[Z^ k ] - E\ztl | Y k , Y k ]\F k \) 



= M k 



(n) 
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Observing that 



E[z^ x \Y k ,Y k \ = e[J2 ^[/<(^^)in+i,n+i]in,n] 

i=k+l 
n 

i=k+l 
n 

= Yl E\fi(Yi,Yj\Y k ,Y k ], 



i=k+l 



we can put (7.9) the other way round to obtain, for k < n — 1, 
fk(Yk, Y k ) = Z^ — E[Z k ^ 1 | Y" fc , Y k 



,(n) 



for k — n, f n (Y n , Y n ) — Z\' . We obtain, for the sums we are interested 



m: 

n 



fc=2 

and for their variances (using the orthogonality of the martingale in- 
crements): 

n 

(7.10) V(S n ) = Y,V (4 n) - E[Z { k n) I n-i]) + v(z< n) . 

fc=2 

Hence, we can approximate £„/ a/^(5 , „) by Afi"' (see Step 5.) and use 
the above argument to conclude, provided we check both assumptions 



(7.7) and (7.8) (as well as approximation). 
To do so, we will follow the proof of |SVj . 

Step 1. We prove inequalities (playing the role of Lemma 3.1 in |S Vj ) . 
We will make repeated use of the property : Osc(itt]) < 5(ir)Osc(r)). 
Let 1 < i < j ' < n. As ||oo < C m its oscillation Osc(Fj) < 2C n . 



From the control we have on the mixing coefficient (see Remark 7.5 ) 



Osc(E[F^\^]) < Osc{F^)S(7r i+1 j) < 2C n (l - a)'"*" 1 . 

For centered random variables, HHoo is bounded by 20sc(), so that, for 
all 1 < i < j < n, 

(7.11) ll^fF^IU < 20sc(E[F^ n) \T l }) < const. n 1/4 (l 
The same arguments also yield the similar 

(7.12) Osc(E[F*\Fi}) < const. n 1/2 (l - a) j ~\ 



a) J -\ 
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Step 2. Putting together inequality (7.11) and the variance lower 
bound Lemma 7.3 (corresponding to Proposition 3.2 |SVJ), we obtain 
an analog of Lemma 3.2 |SV] . 



Using inequality (7.11) and EFi = 0, we see that 



l4 n) lloo < ^||£[ifV fc ]||oo < const. n 1/4 ]T(l -a) l - k < 2C n /a. 



i=k 



i=k 



Then by the variance lower bound (Lemma |7.3[ ), 

\\Z {n) \\ n 1 / 4 

|Zy fc Moo - 



(7.13) sup — ^=^= < const. , 

l<fc<n y/V(S n ) ay/fn 

which is, under our assumptions o(l). 

This will yield the ass ymp totic equivalence of S n / ^JV{S n ) and M^"' 
as well as negligibility (7.7) of the differences 

Step 3. Now let us set Hi = E[(^f ) ) 2 ]\J r i _i]. Before all, we observe 
that 



l + o(l) 



1=1 



i=l 



in view of ( |7.10[ ) and (7.13). In the next two steps, we are going to 
prove that 23I=i Hi ^ 1 in L 2 . This first step is rather general and 
correspond to Lemma 3.3 in |SV] . Observe that, since E[Y^ =1 Hi] = 
1 + o(l), the L 2 -norm 

n n n n 



i=l 



i=l 



i=l 



i=l 



tends to if and only if E[(Y^ =1 Hi) 2 ] — > 1. Following the proof of 
Lemma 3.3 in IS VI we write 



n-l 



mJ2 H ^ = E*W) + 2 E^( E ^ 



i=i 



i=l 



j=l 



and, 



rt-l 



i=l i=l i=l j'=i+l 

Taking the difference and recalling E\^™ =1 Hi] = 1 + o(l), we deduce 
that 

71 71 — 1 / 71 71 \ 



i=i 



8=1 



i=i+l 
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We observe that, since Hi is measurable with respect to J^-i, 

n n 

E[H t H i\ = E[HiE[ Hjl^]]- 

j=i+l j=i+l 

Hence, 

n n—l / n n \ 

l^[(E^) 2 ]"l| < [E[Y,H j \? i - 1 ]-E[Y l H j ] ]+o(l) 

i=l j=l \ j=i+l j=i+l / 

/n-1 \ 



< snp SC (E[( E 



Step 4. Hence it remains only to prove 



(7.14) ^ sup Osc(E[( E ^)|^_ 1 ]) = o(l). 

First we write 



Ki<n-1 
J=«+l 



jr'=i+l j=i+l 



j=i+l 



i-1 



and use the orthogonality property (martingale) of the increments 
(E[£ n) Zi n) \F u ] = provided r > s > u) : 

n n 

^[(E^')i^-i]) = s[( £ ej n) ) 2 i^-i]- 

j=i+l j=i+l 

Then we write all this in terms of Z and F. We recall that for A; < n — l, 
F k = Zl n) - E[Z^\T k \, while F n = Z„ n) . 

n n 
j=i+l j=i+l 

= V(S n )-\ ]T F 3 + E[Z^] - E[Z^\^]) + F n - E[Z^\JF n ^\) 

j=i+l 
n 

= v{s n y\Y. F i- E &t\n)- 

j=i+l 
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j=i+i 



j=i+i 

n 



j=i+l 

The last term is bounded by sup 2<i<n _ 1 V(S , n )~ 1 ||Z^ 1 ||^ = o(l); hence 
its oscillation is also uniformly o(l). For the first term, we write 

n 

Osc{V{S n y l E[{ Fif^i-x]) < ViSn)- 1 Osc{E[FjF m \JFi-\\). 

j=i+l i+l<j,m<n 



But, in the same spirit as for the proof of inequality (7.11), we have, 
for all 1 < / < i < j < n, 



Osc{E F^ ) E[Ff i) \J' i \\J r i 



< (1 -a) 



Osc(F^ n> E[Fj [Ti]) 
< (1 - a)'- 1 - 1 (Osc{F^)\\E[F^\Ti} 



+ \\F ( > n) \\ OQ Osc{E[F) n >\F l }) 



Ml 



< const. n 1/2 (l - a) l - l (l - a 



i-li 



It follows that, for alH < n, 

n 

Osc{E[{ H 3)\Fi-i\) < ^st. y(S n )-V/ 2 /a 2 = o(l). 
j=i+i 



Step 5. Conclusion. On the one hand, Step 2. shows that almost 
surely, 



S ,, 



< l|o ° =0 (1). 



On the other hand. Ste ps 1. and 2. show that Q7.7 ) holds while Steps 3. 
and 4. show that (Q holds so that il4 n) =4> iV(0, 1). This concludes 



the proof of Proposition |7.4| □ 



7.7. The variance. We end the proof by giving a direct proof for 
Lemma [7.31 about the lower bound of the variance. 



Step 1. 
i > 1, 
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To start with, we claim that there is 7 > such that, for all 



(7.15) 



E 



fi(Yi,Y^-E fi(Yi,Yi)\Yi,Yi 



i+1 



Yi, Yi + \ 



> 7- 



To simplify we write Fj = fi(Yi, Y%) and Gi = Fj — E[Fi\Yi, Y i+1 ] so that 
(7.15) becomes : 

E[G 2 l \Y,Y l+1 ]>e. 

To prove the claim, we fix i > 1. We set m = uii-i, M = rrii, p = \x\, 
q = Mi — 3p, r = M — 2p, s = M — p, and £ = M — m. We stress 
that with these notations, T q • • • r r _i = r r ■ ■ ■ t s _i = r s ■ ■ ■ tm-i = X an d 
recall that the matrix associated with \ is primitive. 

For w G E p+1 , we set £ w = {X r = wo,...,X s = w p }. Elementary 
considerations about Markov chains and primitivity of the matrices 
associated to x yield existence of e\ > and e 2 > such that, for all 
w e E p+1 , P(X r = w , X s =w p \ Y h Y i+1 ) > ei , P{S W | X r = w , X s = 
w p ) > e 2 . 



We also observe that ( 7. 1 ) shows that that there are two paths u and 
v in E p+1 (with u = v and u p = v p ) such that 



k=r 



k=r 



> e > 0. 



We observe that E[G\ \ Y i: Y i+1 ] > E[t £uUSv Gf \ Y { , Y i+1 ], so that, since 
£ u n £ v = 0, E\G\ | Yi, Y l+1 ] > E[leJ&i I Y u Y i+1 ] + E[l £v G] \ Y u Y l+1 }. 

By definition, 

E[Gl | Yi, Y i+1 ] = Gfci ■ ■ ■ zt) 2 P(Yi = z\Y h Y+i)- 

In the same language, 

Elle.^ | Y u Y l+1 ] > YI Gi(wuz) 2 P(X m ■■■X M = wuz\Y, Y i+1 ). 

w£E r ,z£EP 

Writing P(X m ■ ■ ■ X M = wuz\Y h Y i+1 ) = P(£ u \X r = u , X s = u p )P(X m ■ ■ 
wuo, X s • ■ ■ Xm = u p z\Yi, Y i+ i), we obtain 

E[G 2 \Y t ,Y l+1 ] > (G^wuz) 2 + Giiwvz) 2 ) 

w€E r ,z£EP 

x e 2 x P(X m ■ ■ ■ X r = wuo, X s --- X M = u p z\Yi, Y i+1 
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For w G E r and z G E p , observe that 



\Gi{wuz) — Gi(wvz)\ 



s-l 



> e, 



k=r k=r 

so that \G l (wuz)\ 2 + \Gi(wvz)\ 2 > e 2 (since a 2 + b 2 > a 2 + b 2 -2\a\ 
(|a| - |6|) 2 ). Hence 

E[G 2 \Y t ,Y l+1 ] > (e 2 /^2P(X m ---X r = wu ,X s ---X M = u p z\Y t ,Y l 



i+l) 



w&E r ,z&EP 
2 



> (e 2 /A)e 2 P(X r = u , X s = u p \Y i} Y i+1 ) 

> e 2 e 2 e 1 /A, 

and the claim is proved with 7 = e 2 e26i/4. 



fi(Yi,Yi) - E fifaYJlYiM 



Step 2. To fulfill the proof of Lemma 7.3, we prove that 

n— 1 n— 1 

i=0 i=0 

or, in other words, 

n—l n—1 



i+l 



V<2»>2>[Gf]. 



i=0 



i=0 



We recall that ER = 0. 



E 



E 



E 



E 



E 



5» : 

^^) 2 |F0,...,^n 



E 



E 



E 



E 



(y^Gi + J^EmnY^]] \Y ,...,Y r 
(j2 G i) +H+(^2E[E l \Y l ,Y t+l ] ) j \Y ,...,Y T 
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where the product term H is 
H = E 



E 



j2m\Yi,Y i+1 ]j , 



and, for all i, 

E[Gi\Y ,...,Y n ] 



E[F i -E[F i \Y h Y i+1 \\Y Q ,...,Y n \ 
E[F i \Y ,...,Y n ]-E[F i \Y i ,Y i+1 ] 
0. 



To conclude, we observe that, since for all i < j < k, E [GiGk \ Yj] = 
(by independence of the past and the future conditionally to the present 
for the Markov chain {X n )), 



E 



E 



5> 



Yq, . . . ,Y n 



E 



E 



Yq, ■ ■ ■ , Y n 



> E 



5> 

i 

J2 E i E [(Gi)) 2 l^o,---,^]] 

i 

Y^E{E{(Gi)f \Y„Y, +1 ]] 



where the last inequality is in debt to (7.15). □ 

8. Specific cases and examples 

To get an idea of the whole picture it seems useful to study what hap- 
pens in specific cases, making further assumptions on the substitution 
or/and on the possible choices of t. We also give concrete examples at 
the end of this section. 

8.1. Synchronization. We consider the case t = 1 and more generally 
t integer or say d n °t integer for some no, i.e. the expansion of t is 
finite. To study this simple case, we observe that (at least after a finite 
number of steps), we can work with a simplified automaton with state 
space A x A because when r k = we never need to know the second 
letter of V since m k + r k < d. 
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Lemma 8.1. When the expansion of t is finite, the subset of states 
{(a, a) ; a e A} is a recurent class of the (simplified) Markov chain. 
Moreover, on this class, the cocycle g is a coboundary. Instead of nor- 
mal fluctuations the limit law of lim^oo ^ * / o h s (x)ds is a Dirac 
mass at 0. 

Proof. We assume that = for all k large enough (larger than no). 
Thus, the Markov chain is stationary. In the automaton associated to 
r = 0, it is clear that a couple (a, a) is followed by a couple (6, b) since 
the positions of both letters are the same in a (a). Therefore the set 
{(a, a) ; a e A} is a recurrent class. 

If &~ n °x belongs to a and hd n ot{&~ no x) is in the same letter a then the 
integral along the unstable leave does not depend on the exact position 
of &~ n °x in a. Indeed shift of the piece of leaf along does not change 
the value of the integral (what is added from one side is subtracted 
from the other side). Moreover, since the value of = is zero, if we 
shift &~ n °x at the beginning of a the point &~ n °x to Vo((6~ n °a;) is 
also at the beginning of a. Thus the leaf from &~ n °x to hd"ot(&~ n °x) 
covers exactly a union of blocks of that scale. Thus, if we refine for all 
n > no, Jq 1 f(h s (o-~ n x))ds = °* f(h s (a~ n °x))ds. The value along 
the Markov chain can change (since it is only an approximation of the 
integrals, at each scale) but it must keep at bounded distance of the 
value of the integral. We claim that the sum of the values along any 
cycle of the recurrent class {(a, a) ; a e A} is zero. In fact, given 
a cycle of class, if the sum of the values of g along this cycle were 
non zero, we could iterate this cycle a large number of time and get 
arbitrarily large value for g. Thus g is a coboundary. The limit law 
of of lim^oo -7= f{f 1 f o h s (x)ds is therefore a Dirac mass at since 

Jo / h s {x)ds is bounded and we divide by \fn. □ 

Definition 8.2. We say that a letter is synchronizable if it appears at 
the same position in the image of two distinct letters. We say that a 
substitution a is synchronizable if for every couple of letters (b,c) there 
is a letter a that appears at the same position in a(b) and a(c). A 
substitution is strongly non synchronizable if it has no synchronizable 
letter. 

Proposition 8.3. If a substitution a is synchronizable, then the (sim- 
plified) Markov chain has a unique recurrent class, the set {(a, a) ; a G 
^4}. On this class, the cocycle g is a coboundary. Therefore the limit 
law o/linin^oo ^= J^ - 1 foh s (x)ds is a Dirac mass at when the expan- 
sion of t in base d is finite. 



Substitutions with eigenvalues of modulus 1 



41 



Proof. By Lemma 8^ , it is enough to prove that the simplified automa- 
ton has only one recurrent class. At each step of the Markov chain there 
is a positive probability to reach the recurrent class {{a, a) ; a G A}. 
Therefore the only recurrent class is the class {(a, a) ; a G A}. □ 

Lemma 8.4. If a has no synchronizable letters, then the recurent class 
{(a, a) ; a G A} of the (simplified) Markov chain is disconnected of the 
remainder of the graph. Hence it is reached only if reached at initial 
time. 

Proof. If b and c are different letters of the alphabet A, there is no 
arrow from the (b, c) to {(a, a) ; a G A}. Otherwise there would 
be a synchronizable letter. Therefore the class {{a, a) ; a G A} is 
disconnected from the remainder of the graph. □ 

In the last paragraph, Figure [2] gives an example of a synchronizable 
substitution on a 3 letters alphabet. Figure [3] gives an example of a 
substitution on a 2 letters alphabet which is non synchronizable. 

8.2. Strongly non synchronizable substitutions on a 2 letters 
alphabet. The following propositions give a recipe to construct strongly 
non synchronizable substitutions on a 2 letters alphabet. We also study 
the ergodic sums for these substitutions for some values of t. 

Proposition 8.5. If A = {a, b} a non synchronizable substitution o 
of length d on A with eigenvalue 1 is uniquely determined by the image 
of a (cr(b) is obtained from a(a) by exchanging a by b and vice versa). 

(k + 1 k \ 

Moreover the matrix M of a has the form M = ( ^ ^ ^ ] where 



k + 1 is the number of a in a(a) and d — 2k + 1. 



Proof. First of all, if a is strongly non synchronizable, for every j in 
{!,... ,d}, the letter number j in a(a) is different from the letter num- 
ber j in a(b). Therefore one of these letter is an a, the other one is 
a b. Thus a{b) obtained from cx(a) is equal to s(a(a)) where s is the 
substitution s(a) = b, s(b) = a. Let i be the number of a in o~(a) and k 
the number of b in a (a). We have k+£ = d. By the previous discussion, 

the matrix M has the form M = [ T „ ) . Since the matrix M has an 



k 

eigenvalue 1, £ — k + 1. □ 

Proposition 8.6. Let a be a substitution of length d on a 2 letters 
alphabet A = {a, b}. We assume that a is strongly non synchronizable 
and that every word of length 3 appears in a(a). Then the graph as- 
sociated to the automaton r = 1 is strongly connected and aperiodic. 
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Moreover g is not a coboundary on this graph. Thus the limit law for 



are 



lim^oo ^= f Q f o h s (x)ds is a normal law when the digits of t 
ultimately constant equal to 1. 

Remark 8.7. This gives a family of examples that satisfy the hypoth- 



esis of Theorem L9_ and hence a proof of Theorem 1.3 

Proof. To prove connectedness of the graph, we first prove that there 
is an arrow from (a, aa) to any other vertex. Let (a, fij) be a triple 
of letters in A. The word aft'j appears in a(a). We consider the word 
a(aa). In this word we see aWa[3^ where the length of W is equal to 
d. Thus the length between a and (3j is d + 1. By definition of the 
automaton, this means that we have an arrow from (a, aa) to (a,/37). 
Now, let us prove that from any triple (a, ftj) there is an arrow to 
(a, aa). If a = a and f3 = a the word aaa appears somewhere in <r(a). 
Thus, by definition of the automaton with r = 1, there is an arrow 
from (a, 07) to (a,aa). If a = a and (3 = b, the word abb appears in 
a (a). At the same position baa appears in a{b) since a is not strongly 
synchronizable. Thus there is an arrow from (a, 67) to (a, aa). If a = b, 
(3 = a, the same reasonning holds. If a = (3 = b, then aaa appears in 
o~(b). Thus, by the same argument, there is an arrow from (0,67) to 
(a, aa). This proves that the graph is strongly connected. 
In fact, we also proved that it is aperiodic since there is an arrow from 
(a, a) to itself (thus a cycle of length 1). 

We prove now that g is not a coboundary. We show that the value of g 
on some arrow from (a, aa) to itself is different from which is enough. 
Let aaa be a subword of a(a). We have a(aa) = PaaaSPaaaS where 
the length of SP is d — 3. We consider the arrow from (a, aa) to itself 
corresponding to m — \P\ + 1. The label of this arrow is the number 
v aa spa- The word aaSPa is a reordering of PaaaS. Thus, the numbers 
VaaSPa and vp aaa s coincide. But vp aaa s = v^ a) = v a . One can check 

that an eigenvector associated to the eigenvalue 1 of M is ( \ ) ■ Thus 



is different from 0. 



We now apply Theorem L2 As there is only one recurrent class which 
is aperiodic, the limit distribution is a normal law. □ 

8.3. Examples. In this paragraph, we construct explicit examples with 
various properties: 

Figure [2] represents the automatom for 



a 





Figure 2. 

and r = 0. The synchronization property is satisfied. The graph 
has one recurrent component but is not strongly connected (there are 
transient components). The eigenfunction / is not a coboundary nev- 
ertheless, when the 2-adic expansion of t is finite, the cocycle g is a 
coboundary. 

Figure [3] represents the automaton for 

fl -> 112 
a : \2 -»■ 221 

and r = 0. The graph has 2 recurrent components: the component 
{(a, a) ; a G .4.} where the cocycle is a coboundary, another component 
where the Markov chain has non zero variance. Thus, the limiting 
distribution is a superposition of a Dirac at and a normal distribution. 
This means that starting from a set of positive measure there is no 
fluctuation at the scale on another set of positive measure there 
are fluctuations at the same scale. 
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Figure 3. 



Figure [4] represents the automaton for 

fl ->■ 112 
a : \2 -»■ 221 

and r = 1. The graph is strongly connected, aperiodic, the variance is 
positive thus the limiting distribution is a normal distribution. 
Figure [5] represents the automaton for 

fl ->■ 112 
a: \2 -»■ 221 

and r = 2. 

The graph contains two connected components. On one component the 
cocycle is a coboundary. We have no explanation for this phenomenon. 
On the other one, there is a positive variance. 
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